Search CORE

181 research outputs found

ASTRID: Accurate Species TRees from Internode Distances

Author: A Criscuolo
BR Larget
D Bryant
DF Robinson
ED Jarvis
G Dasarathy
I Gronau
J Chifman
J Heled
J Sukumaran
JFC Kingman
JH Degnan
JP Gatesy
L Kubatko
L Liu
L Liu
L Liu
L Liu
L Liu
L Nakhleh
LL Knowles
MN Price
MS Bayzid
MS Bayzid
MS Bayzid
N Saitou
Pranjal Vachaspati
R Desper
S Mirarab
S Mirarab
S Mirarab
S Mirarab
S Mirarab
S Roch
S Roch
S Roch
S Song
S Song
T Warnow
Tandy Warnow
W Maddison
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

A format for phylogenetic placements

Author: A Kluge
A Monier
Aaron Gallagher
Alexandros Stamatakis
C Von Mering
D Crockford
F Matsen
F Matsen
Frederick A. Matsen
J Caporaso
J Felsenstein
Jonathan H. Badger
M Pirrung
M Stark
M Wu
Noah G. Hoffman
O Westesson
S Berger
S Berger
S Evans
S Mirarab
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 16/01/2012
Field of study

We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g. short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements, and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.Comment: Documents version 3 of the forma

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Identifying the favored mutation in a positive selective sweep.

Author: A Ferrer-Admetlla
Ali Akbari
Arya Iranmehr
BF Voight
C Heffelfinger
CD Campbell
DR Schrider
DR Zerbino
G Coop
G Ewing
H Chen
J Ohashi
JJ Vitti
Joseph J Vitti
KJ Galinsky
M DeGiorgio
M Pybus
M Wang
MC Cornelis
MD Shriver
Mehrdad Bakhtiari
MI Jensen-Seaman
MW Nachman
NR Garud
P Azad
P Pavlidis
Pardis C Sabeti
PC Sabeti
PC Sabeti
PC Sabeti
R Nielsen
R Ronen
R Ronen
S Beleza
S Fan
S Gravel
S Wilde
SA Tishkoff
Siavash Mirarab
SR Grossman
T Stobdan
Vineet Bafna
Y Field
Y Kim
ZA Szpiech
Publication venue: eScholarship, University of California
Publication date: 01/04/2018
Field of study

Most approaches that capture signatures of selective sweeps in population genomics data do not identify the specific mutation favored by selection. We present iSAFE (for "integrated selection of allele favored by evolution"), a method that enables researchers to accurately pinpoint the favored mutation in a large region (∼5 Mbp) by using a statistic derived solely from population genetics signals. iSAFE does not require knowledge of demography, the phenotype under selection, or functional annotations of mutations

Crossref

eScholarship - University of California

Ultra-large alignments using phylogeny-aware profiles

Author: A Stamatakis
C Daskalakis
CX Chan
CX Chan
DA Morrison
DJ Zwickl
DT Jones
EP Nawrocki
F Morcos
F Sievers
GB Gloor
GR Reeck
J Stoye
JA Cuff
JD Thompson
JJ Cannone
K Katoh
K Liu
K Liu
K Liu
K Mizuguchi
Keerthana Kumar
MN Price
Nam-phuong D. Nguyen
RC Edgar
RD Finn
S Mirarab
S Mirarab
S Mirarab
S Nelesen
Siavash Mirarab
SR Eddy
Tandy Warnow
W Fletcher
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Precise phylogenetic analysis of microbial isolates and genomes from metagenomes using PhyloPhlAn 3.0

Author: Asnicar F.
Beghini F.
Bolzan M.
Cumbo F.
Huttenhower C.
Knight R.
Kopylova E.
Manara S.
Manghi P.
May U.
Mengoni C.
Mirarab S.
Pasolli E.
Sanders J. G.
Segata N.
Thomas A. M.
Zhu Q.
Zolfo M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Microbial genomes are available at an ever-increasing pace, as cultivation and sequencing become cheaper and obtaining metagenome-assembled genomes (MAGs) becomes more effective. Phylogenetic placement methods to contextualize hundreds of thousands of genomes must thus be efficiently scalable and sensitive from closely related strains to divergent phyla. We present PhyloPhlAn 3.0, an accurate, rapid, and easy-to-use method for large-scale microbial genome characterization and phylogenetic analysis at multiple levels of resolution. PhyloPhlAn 3.0 can assign genomes from isolate sequencing or MAGs to species-level genome bins built from >230,000 publically available sequences. For individual clades of interest, it reconstructs strain-level phylogenies from among the closest species using clade-specific maximally informative markers. At the other extreme of resolution, it scales to large phylogenies comprising >17,000 microbial species. Examples including Staphylococcus aureus isolates, gut metagenomes, and meta-analyses demonstrate the ability of PhyloPhlAn 3.0 to support genomic and metagenomic analyses

Archivio della ricerca - Università degli studi di Napoli Federico II

High diversity of picornaviruses in rats from different continents revealed by deep sequencing

Author: Aljofan M
Altschul SF
Altschul SF
Benson DA
Bernhart SH
Boisvert S
Chopra G
Coghlan ML
de Groot RJ
Drexler JF
Drexler JF
Easterbrook JD
Edgar RC
Edgar RC
Firth C
Friis-Nielsen J
Gatherer D
Geng H
Griffiths-Jones S
Günther S
Hahn H
Himsworth CG
Holtz LR
Honkavuori KS
Hugh-Jones ME
Hunter AA
Huson DH
Jirintai S
Jones MS
Koressaar T
Ksiazek TG
Kurtz S
Langmead B
Li H
Lindgreen S
Maurice H
Meerburg BG
Mirarab S
Mirarab S
Murray DC
Ng TFF
Nielsen ACY
Oleszak EL
Palacios G
Phan TG
Phan TG
Pickett BE
Punta M
Rice P
Sachsenröder J
Schein MW
Schäffer AA
Spyrou V
Stamatakis A
Taberlet P
Tapparel C
Taylor PG
Truong QL
Victoria JG
Will S
Wolf S
Yu J-M
Zeale MRK
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Outbreaks of zoonotic diseases in humans and livestock are not uncommon, and an important component in containment of such emerging viral diseases is rapid and reliable diagnostics. Such methods are often PCR-based and hence require the availability of sequence data from the pathogen. Rattus norvegicus (R. norvegicus) is a known reservoir for important zoonotic pathogens. Transmission may be direct via contact with the animal, for example, through exposure to its faecal matter, or indirectly mediated by arthropod vectors. Here we investigated the viral content in rat faecal matter (n=29) collected from two continents by analyzing 2.2 billion next-generation sequencing reads derived from both DNA and RNA. Among other virus families, we found sequences from members of the Picornaviridae to be abundant in the microbiome of all the samples. Here we describe the diversity of the picornavirus-like contigs including near-full-length genomes closely related to the Boone cardiovirus and Theiler's encephalomyelitis virus. From this study, we conclude that picornaviruses within R. norvegicus are more diverse than previously recognized. The virome of R. norvegicus should be investigated further to assess the full potential for zoonotic virus transmission

Crossref

Directory of Open Access Journals

Copenhagen University Research Information System

PubMed Central

espace@Curtin

Online Research Database In Technology

An analytical upper bound on the number of loci required for all splits of a species tree to appear in a set of gene trees

Author: B Rannala
C Ané
C Than
C-I Wu
CG Schrago
E Milot
E Mossel
EM Jewett
ES Allman
ES Allman
F Bokma
G Dasarathy
J Heled
JA Rice
JH Degnan
JH Degnan
JH Degnan
JH Degnan
JH Degnan
JH Degnan
L Liu
L Liu
L Liu
L Liu
Lawrence H. Uricchio
M DeGiorgio
MT Hallett
NA Rosenberg
NA Rosenberg
NA Rosenberg
Noah A. Rosenberg
P Pamilo
PJ Cock
R Mehta
S Mirarab
S Mirarab
S Roch
S Tavaré
T Stadler
T Stadler
Tandy Warnow
Y Wu
Y Yu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

PASTA: Ultra-Large Multiple Sequence Alignment

Author: A. Stamatakis
F. Matsen
F. Sievers
K. Katoh
K. Katoh
K. Liu
K. Liu
M.A. Larkin
M.A. Suchard
R. Finn
R.C. Edgar
R.C. Edgar
S. Eddy
S. Mirarab
S. Nelesen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

In this paper, we introduce a new and highly scalable algorithm, PASTA, for large-scale multiple sequence alignment estimation. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy of the leading alignment methods on large datasets, and is able to analyze much larger datasets than the current methods. We also show that trees estimated on PASTA alignments are highly accurate – slightly better than SATe ́ trees, but with substantial improvements rela-tive to other methods. Finally, PASTA is very fast, highly parallelizable, and requires relatively little memory

CiteSeerX

Crossref

A roadmap for global synthesis of the plant tree of life

Author: Antonelli Alexandre
Baker William J.
Bennett Dominic J.
Botigue Laura R.
Burleigh J. Gordon
Dodsworth Steven
Eiserhardt Wolf L.
Enquist Brian J.
Forest Felix
Kim Jan T.
Kozlov Alexey M.
Leitch Ilia J.
Maitner Brian S.
Mirarab Siavash
Perez-Escobar Oscar A.
Piel William H.
Pokorny Lisa
Rahbek Carsten
Sandel Brody
Smith Stephen A.
Stamatakis Alexandros
Vos Rutger A.
Warnow Tandy
Publication venue: 'Wiley'
Publication date: 01/01/2018
Field of study

Providing science and society with an integrated, up-to-date, high quality, open, reproducible and sustainable plant tree of life would be a huge service that is now coming within reach. However, synthesizing the growing body of DNA sequence data in the public domain and disseminating the trees to a diverse audience are often not straightforward due to numerous informatics barriers. While big synthetic plant phylogenies are being built, they remain static and become quickly outdated as new data are published and tree-building methods improve. Moreover, the body of existing phylogenetic evidence is hard to navigate and access for non-experts. We propose that our community of botanists, tree builders, and informaticians should converge on a modular framework for data integration and phylogenetic analysis, allowing easy collaboration, updating, data sourcing and flexible analyses. With support from major institutions, this pipeline should be re-run at regular intervals, storing trees and their metadata long-term. Providing the trees to a diverse global audience through user-friendly front ends and application development interfaces should also be a priority. Interactive interfaces could be used to solicit user feedback and thus improve data quality and to coordinate the generation of new data. We conclude by outlining a number of steps that we suggest the scientific community should take to achieve global phylogenetic synthesis

Shared Research Repository

Copenhagen University Research Information System

University of Bedfordshire Repository

Deep Blue Documents at the University of Michigan